Finding Good Conversations Online: The Yahoo News Annotated Comments Corpus

نویسندگان

  • Courtney Napoles
  • Joel R. Tetreault
  • Aasish Pappu
  • Enrica Rosato
  • Brian Provenzale
چکیده

This work presents a dataset and annotation scheme for the new task of identifying “good” conversations that occur online, which we call ERICs: Engaging, Respectful, and/or Informative Conversations. We develop a taxonomy to reflect features of entire threads and individual comments which we believe contribute to identifying ERICs; code a novel dataset of Yahoo News comment threads (2.4k threads and 10k comments) and 1k threads from the Internet Argument Corpus; and analyze the features characteristic of ERICs. This is one of the largest annotated corpora of online human dialogues, with the most detailed set of annotations. It will be valuable for identifying ERICs and other aspects of argumentation, dialogue, and discourse.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using New York Times Picks to Identify Constructive Comments

We examine the extent to which we are able to automatically identify constructive online comments. We build several classifiers using New York Times Picks as positive examples and non-constructive thread comments from the Yahoo News Annotated Comments Corpus as negative examples of constructive online comments. We evaluate these classifiers on a crowdannotated corpus containing 1,121 comments. ...

متن کامل

The SENSEI Annotated Corpus: Human Summaries of Reader Comment Conversations in On-line News

Researchers are beginning to explore how to generate summaries of extended argumentative conversations in social media, such as those found in reader comments in on-line news. To date, however, there has been little discussion of what these summaries should be like and a lack of humanauthored exemplars, quite likely because writing summaries of this kind of interchange is so difficult. In this ...

متن کامل

مطالعۀ الگوهای جمعیت‌شناختی و رفتاری خوانندگان برای اشاعۀ گزینشی اخبار

Purpose: The current research focuses on selective dissemination of news and aims at finding patterns for recognition of readers’ favorite news through web mining technique. Method: Data for this research was collected from the Yahoo News Website. The source of news was Associated Press. 840 news dated between 2011/3/1 and 2011/5/10 was analyzed through subject clustering technique. Findings:...

متن کامل

Automatically Identifying Good Conversations Online (Yes, They Do Exist!)

Online news platforms curate high-quality content for their readers and, in many cases, users can post comments in response. While comment threads routinely contain unproductive banter, insults, or users “shouting” over each other, there are often good discussions buried among the noise. In this paper, we define a new task of identifying “good” conversations, which we call ERICs—Engaging, Respe...

متن کامل

Multilevel Annotation of Agreement and Disagreement in Italian News Blogs

In this paper, we present a corpus of news blog conversations in Italian annotated with gold standard agreement/disagreement relations at message and sentence levels. This is the first resource of this kind in Italian. From the analysis of ADRs at the two levels emerged that agreement annotated at message level is consistent and generally reflected at sentence level, and that the structure of d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017